Corpus: scn_wikipedia_2016_10K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
rilativa catigurìa 96 95 89 1.15
l'elencu cumpretu 89 93 89 1.04
Stati Uniti 51 49 43 1.35
Courtesy http://digilander 32 32 32 1.00
S'hai traduciutu 25 30 25 1.20
fàrila canùsciri 28 29 25 1.30
WikiLettera St'artìculu 24 26 24 1.08
traduciutu n'artìculu 30 25 25 1.20
Metropulitana di Palermu Linea 3 4 3 1.33
Cing Mun 4 4 4 1.00
Jean-Jacques Rousseau 4 4 4 1.00
Settimiu Severu 3 4 3 1.33
Proteles cristatus 3 4 3 1.33
hatturi produttivi 3 4 3 1.33
Catigurìa 2014-2015 4 3 3 1.33
Bilingual Anthology 3 3 3 1.00
Can't Beat 3 3 3 1.00
Ban Ciau 3 3 3 1.00
Moral Fables 3 3 3 1.00
Foo Fighters 3 3 3 1.00
56 msec needed at 2018-01-18 04:11